﻿ The Right Frontier Constraint Holds Unconditionally Dan Cristea “Alexandru Ioan Cuza” University of Iaşi, Faculty of Computer Science Romanian Academy Iaşi branch, Institute for Theoretical Computer Science dcristea@infoiasi ro text corresponds to the span of text ana- Abstract lysed, in the same linear order 1 The paper re-opens the controversial case The Right Frontier Constraint (RFC) is men- of the right frontier constraint A formal tioned in discourse in two settings: as an attach- proof of its applicability in models that ment constraint in an incremental discourse use certain types of trees as modalities to development of the tree structure (for instance, represent the structure of discourse is (Cristea and Webber, 1997)), and as a referential given Then its violation, reported to oc-constraint defining the regions of the discourse cur in discourses “under pressure”, is re-model taken to be in focus, therefore introducing visited and a proposal for discourse discourse entities which are recoverable by refer- structure representation that deviates from ential expressions contained in the last mentioned classical trees is made discourse unit (Polanyi, 1988; Webber, 1991) RFC has recently received new attention among scholars preoccupied to model the discourse phe- 1 Introduction nomena (Asher and Lascarides, 2003; Sassen and There are two situations when one can associate a Kühnlein, 2005), in relation to the discovering of data structure type with a process intended to put counter-examples in corpora of chat communica- in evidence the structure of the discourse (as a dis-tion course parser): at any moment during the interpre-In this paper, a formal proof on the applicability tation of the discourse and at the end of the of RFC over discourse trees that observe the SP is interpretation process If the interpretation of the given (section 2) and a proposal of enhancement of discourse structure is modelled to be synchronous the representation by replacing trees with graphs is with a cognitive process (like reading or listening) made (section 3) we have an incremental type of discourse process- ing One fundamental property of the discourse 2 Why cannot the Right Frontier Con- trees that is supposed to hold in both the interme-straint be violated? diate and the final situations of the tree as a repre-In both settings mentioned above, RFC is empiri- sentation of the structure is the Sequentiality cally defended based on observations over linguis- Principle (SP), (for instance, (Marcu, 2000)) It tic material However, it is easy to formally prove states that: this discourse property once one agrees to operate A left-to-right reading of the terminal frontier of a discourse tree associated to a 1 Many authors suppose this principle holds in most cases The problems are given by the interrupted sentences In most anno- tation conventions applied to discourse structure (see, for in- stance, the RSTTool of Michael O’Donnell (1997), the constituent parts are linked in chains using attributes like Same-Unit, etc ) only with discourse trees observing the SP The lowest levels of the tree (therefore which do not formal prove follows below as the natural yield of have any daughters) a sequence of propositions In discussing the in-Adjunction (see Figure 1a) operates on two cremental development of a discourse tree, two trees: a developing tree (DT) produced so far dur- operations are considered, usually mentioned by ing parsing (called “initial tree” in TAG) and an researchers dealing with incremental discourse auxiliary tree (AT), which is being contributed by parsing: adjunction and substitution, inspired by the current minimal chunk of text under process- tree adjoining grammars (TAG) (Joshi, 1987) ing The auxiliary tree is "inserted" in the DT at the In processes of this type it is clear that the final level of a certain node of DT The auxiliary tree discourse tree is obtained after the last step of the owns a special node placed on its terminal frontier, in Figure 1 and process and that, at any intermediate moment, a called foot node, and marked ✳ partially developed discourse tree reflects the in-following, whose role is to take over the sub-tree terpretation that the listener has on the discourse of the initial tree "cut" by the operation of adjunc- heard so far There is, to my knowledge, little cor-tion An AT whose foot node is placed in the left pus evidence for a “garden path” behaviour at the extremity of its terminal frontier will be called left- level of discourse understanding that would impose footed auxiliary tree The TAG restriction that the to the incremental model to accommodate a way of root and the foot nodes of the AT have the same reconsidering the structure build so far However, a label does not apply here This implies that none of certain buffering mechanism able to integrate new the formal properties that were proved to hold for material onto the already developed structure in TAGs are supposed to hold in our setting Only the greater chunks than the minimum text span used to cutting and insertion machinery is kept developing tree transformed tree developing tree auxiliary tree transformed tree Y elementary tree X X Y X ✳ Ø substitution node node Xfoot a b Figure 1 Adjunction (a) and substitution (b) represent a leaf tree node would always allow for a smoother advancement towards the final structure Substitution (see Figure 1b) is the operation by Şoricuţ and Marcu (2003) present corpus evidence which a tree, called elementary tree, replaces a in that in 95% of the cases there is always a tree node node, called substitution node, and noted Ø covering a sentence, which suggests to consider the Figure 1b, placed on the right extreme of the ter-2 dimension of the sentence as the advancing buffer minal frontier of the DT Our argument applies to discourse trees at any In what follows all nodes are supposed to be step during an incremental developing process We marked with a label A sequence of nodes is also will give first some definitions represented by the sequence of their labels First, The right frontier (RF) of a tree is the sequence we will give without proof some evident properties of nodes that starts in the root of the tree and con-of tree adjoining tinues with all nodes placed in the right extreme of the tree at any level The terminal frontier (TF) is the sequence of nodes, counted left-to-right, on the 2 Although mentioned here, we will ignore in this study the use of substitution Proposition 1: If the root node of a tree τ has as Proposition 3: In any span of text σ whose dis- daughters the nodes a1, …, an (in this order, from course structure is the tree τ , any node of its right left to right) and the TF frontier of the tree with frontier covers a right substring of σ (see Figure root ai is the sequence of nodes σ i, for each 3b) i=1 to n, then the TF of tree τ is σ …° σ (see The announced Right Frontier Property can now 1 °n Figure …), where “°” is the string concatenation be proved operation The Right Frontier Property: At any step of an incremental discourse parsing, if the DT does not contain substitution nodes, the SP is observed if and only if all operations are adjoinings of left- a1 … an footed ATs onto nodes of the right frontier of the DT Proof: If the adjoining is realised with a left-footed 1 … σn σAT on the right frontier of the developing tree, then the SP is observed The property results by Figure 2 Adjacency of siblings is reflected as virtue of Proposition 3 As can be noticed on adjacency of covered spans Figure 4, the TF of the resulting tree is σ’ = σ ° σ, in accordance with the SP 0 °1a Proposition 2: If an AT with the terminal fron-σ tier σ ✳ ° σ is adjoined at the position of the We will show that any adjunction operation 1 °2 other than that of a left-footed AT on a node of the root of a DT having the terminal frontier σ 0, then the terminal frontier of the resulting tree is RF of the DT contradicts the SP There are two σ σ ° σ (see Figure 3a) cases to consider: 1 °02 X τ X σ 0σ σ 12 σ 2 1 σ ✳ σ σ 0 a b Figure 3 Adjoining properties in trees a ’a τ τ ✳σ a a1 σ 0 a 1 σσ 0σ σ 1 Figure 4 Adjunction of a left-footed AT on the RF of the DT  the adjunction of an AT which is not left-pressure condition is not satisfied, therefore if we footed on the RF of the DT, and are under pressure, the RFC does not apply  the adjunction of an AT of any kind in a Lets take one of their dialogues and complement node of the DT which does not belong to its it with more text at the beginning: RF Case a follows from both Proposition 2 and Example 1 ’ τ a τ τ’ aτ a a1 a1 a σσσ σ a2a1a1a2a1 a1 ✳ ✳ a1 a2σ σ a1σ a2σ 2σ 0 σ σ12 0σσ 1 σ 0 σ σ0 σ 1 σ 1 a b Figure 5 Cases that result in violation of the SP Proposition 3 (notice the notations on Figure 5a): by adjoining the AT (with the TF σ ✳ ° σ) on a1 B: I’ve recently felt confused a1 °a2 a node a1 (covering the TF σ 1) belonging to the when seeing Maria talking with Mi- right frontier of the DT, the TF modifies from chael σ σ to σ ° σ ° σ ° σ, thus violating the SP advise is that I’llb1 have B: myThe degree reason in I’m asking your 0 °10a11a2 The correct order would have been σ σ ° σ ° June and right now I’m not in my best 0 °1a1 σ a2, while now the nodes of σ1 are intercalated be-shape tween σ a1 and σa2 a2 A: oh yea, then, what we have Case b follows from a recursive application of here is a nice case of jealousy Propositions 1 and the properties of string concate-a3 B: would you say that I’m in- nations (notations in Figure 5b): if the adjoining of deed only jealous? c1 B: do you have another consul- an AT (with the TF σ ✳ ° σ) is done on a node tation hour scheduled soon? a1 °a2 a1 of the DT (covering the TF σ 1), then the TF de-b2 A: Try to be with yourself af- velops from σ σ ° σ to σ ° σ ° σ ° σ° σ, ter all you’re heading for your final 0 °120a11a22 which also violates the SP, the correct order having degree! to be σ σ ° σ ° σ ° σ b3 B: mhm, well, but concentrating 0 °12a1a2 on my studies often doesn’t work 3 On the violation of the RFC c2 a4 A:A: yes,only alwaysjealous on that Monday would al- Violation of the RFC has been recently reported by readythere sufficeis more asto problem,it but I think some researchers, as Asher and Lascarides (2003) and Sassen and Kühnlein (2005), after studying Starting from a3, Example 1 displays the same corpora of chat communication Sassen and Küh-dialogue as in their corpus I have marked the three nlein explain this finding by the pressure to keep separate threads (themes) with the indices a, b, c the channel open or to signal presence on chat dia-and numbered them in sequence Suppose now that logue, as opposed to face-to-face dialogue or the dialogue would go on like this: monologue They complement the classical RFC with a conditional, expressing the absence of pres-A: to this you have to find a solu- sure By doing this, however, we are left with wild tion immediately possibilities of conducting dialogues in cases when pressure exists Indeed they say: if the absence of At this point this would have been understood as communication threads, the corresponding DTs either adding to B’s problems his recent jealousy, merge Consider the example: following the thread a, or that concentrating on B’s studies doesn’t work, following the thread b There Example 2 is no way however in which this could have been ov? understood to mean the meetings between Maria a1 A: how was your trip to Braş and Michael (in the sense to intervene for putting b1 A: oh, did I tell you? I have an end to them) or not being in B’s best shape seen yesterday evening Michael in the What I’m trying to convince the reader is that in disco b2 did you meet him anymore since you two split? r a2 B Fantastic! lot’s of nice guys and lot of fun r1 ab wow, but, you know, he was r1 too in that trip! The two evolving trees before ab are sketched in 1 a2 a1 a2 aFigure 6a, while the resulting graph after the merge r2 – in Figure 6b r2 ab At times, the dialogue can evidence different threads, all evolving from a common trunk Con- b1 b2 b1 r sider the following chat: Example 3 b2 ab ab1 A: so, you didn’t know that I a b finished with Michael?! ab2 it happened last month af- ter I came back from Mexico Figure 6 Merging of two DTs a3 B: oh, I’m sorry, are you still mourning after him or you have communications that fit the “pressure” pattern, as already someone else? those displayed by chats, different themes could be a4 A: negative! I’m ok and need a interleaved, while, however, each of them do still period of loneliness observe some kind of RFC restrictions which seem a5 B: you cannot resist long like to apply to individually developing trees It seems this I know you… that the human mind has the capacity to record and there? b3 so, have you seen the pyra-mids follow more communication threads in parallel, in each of them the discourse unfolding by respecting It is not unusual that a dialogue communicates the basic principles of well-formedness3 A model ideas or situations which, although different, are for this behaviour can again be expressed by the tightly interleaved, configuring, therefore, just one old tree structures observing the SP and the RFC, communication thread In such a dialogue it is also with the only change that each communication possible that, at a certain moment, two separate thread develops its own tree themes diverge In Example 3, ab1 and ab2 do still Two operations can apply over discourse trees belong to the same thread, as signalled by the ana- developing in parallel: merging and splitting phoric pronoun it in the second sentence which is When one AT adjoins simultaneously onto two (or anchored in the first sentence Then a3-a5 develop more) nodes belonging to different but parallel on the separation of A from Michael, while b3 re- fers back to A’s trip to Mexico, making evident the yield of a separate thread However, each of the two threads still obey to the RFC behaviour Put in 3terms of discourse structures, this means that a de- I thank to an anonymous reviewer who noticed that in case of chats the memory of the conversation could be helped also veloping tree structure suffers a split at a certain by the history displayed on the screen r2 r1 r3 r2 b2 a3r4r3 r1 ab1 ar1 a4r45r5 ab1 ab2 ar1 ab1 a3 a4 a5 ab1 r5 ab2 b3 a b ab2 b3 Figure 7 Splitting a DT onto two separate threads point, allowing for asynchronous ulterior un-essing, accommodate the observation that, at any folding, as in Figure 7 moment during interpretation of the discourse, the Figure 7a shows the two ways in which the ini-hearer can access, using weak referential means, tial tree made of ab1 and ab2 can evolve In the only certain parts of the past discourse, while upper tree, a3 is adjoined at the top most node r1 others are obscured In accordance also with other (since it elaborates on the separation from Michael researchers (Gundel et al , 1993), by weak referen- narration communicated by the sequence ab1-ab2), tial means I understand referential signallers that while the following nodes, a4 and a5, are each ad-point at previously introduced discourse entities by joined at the bottom most nodes on the right fron-describers displaying not-at-all or poor semantic tier of their DTs (since both complement with features, as pronouns or demonstratives Their use more details their previous turns, on the same ”Mi-relies on the sufficiency of the focus mechanism as chael” story) In the lower tree, b3, detailing the a delimitating tool4 In contrast, rich referential Mexico trip mentioned in ab2, is adjoined at this means are referential expressions that individuate last node In both trees the roots and the edges are an entity among many in terms of properties of the marked in distinguished patterns These markings individual having to be singled out5 They can be will allow computing two different right frontiers proper nouns and noun phrases that exhibit a lexi- on the graph of Figure 7b, obtained after splitting cal and/or an ontological relation with the name of the evolving tree structure onto two separate the pointed discourse entity In (Grosz and Sidner, threads In practice, in an incremental development 1986), the focusing effect in the dynamics of dis- of the discourse structure, the graph, as displayed course interpretation is modelled by a stack of at- in Figure 7b, is built incrementally tentional states, the newest ones appearing on top The problem with this approach is that, at the of the older ones In (Cristea et al , 1998) the very moment of splitting the tree onto a graph, the RFC seems to be violated Indeed, how can we 4 t forms and colors, explain the adjoining of b3 onto the ab2 node, which Imagine one a pool of objects of differenamong should be indicated, and a lantern beam whish is closed at the moment when b3 is uttered moving above them and, so, individuating in turn, one or an- (see the upper tree in Figure 7a)? other In this situation a verbal expression may sufficiently I will comment on this apparent contradiction by signal only the moment when the pointing should be consid- bringing into focus those theories of the relation-ered, the individuation of one element among many being made by the focussing beam itself ship between discourse structure and referentiality 5 This is always the case when the object to be indicated is which, modelling the dynamics of discourse proc-outside the lantern beam or there are more than one object lightened by the spot focussing effect is obtained by considering part of a tree that obeys the Principle of Sequentiality by the vein expression, computed on the base of the its terminal nodes, adjoining, as an operation of discourse structure, which is made of elementary incremental tree development, should always be discourse unit the referring expressions belong to applied onto a node belonging to the right frontier Remember also that there is a nuance in the Examples are given that demonstrate that even in formulation of the RFC: most people recognise cases of intertwined chats the RFC still applies if that, if the principle is violated, the obtained dis-considered on individual thematic threads We course is less readable, and not that it is completely propose to replace discourse trees by discourse unacceptable Sometimes, when RFC is violated graphs on which RFC can again be verified First, we feel like we need to put to work some more intertwined different themes are represented by inferential machinery in order to understand the distinct trees, each of them obeying the RFC Sec- discourse Some other time, the discourse fails In ond, during discourse unfolding, when different the example above we do not have this latter case: themes, representing distinct dialogue or mono- the dialogue can be understood, apparently even logue threads, approach, their corresponding trees without the need to bounce back in order to local-can merge in a discourse graph in which common ise the referent for the pyramids there The reason nodes are represented uniquely Third, from one is that the referential expression the pyramids is a developing tree structure, a graph could yield, at very powerfully evoking one It resonates instantly moments when the theme diverges onto two sepa- and unambiguously with Mexico from the previous rate threads The splitting is announced by power- discourse If, instead, the last turn would have been ful evoking referential means which do not find how often have you been there? the discourse their corresponding antecedents on the right fron- would be perceived as failed, because no link tier could have been found with an entity contributed by the nodes on the RF and the question contains References nothing powerful enough to evoke unambiguously an entity in the rest of the discourse Nicolas Asher and Alex Lascarides 2003 Logics of So, the conclusion is that discourse trees can Discourse Cambridge UP split in points localised by powerful evoking Dan Cristea, Nancy Ide, and Laurent Romary 1998 means in past discourse Veins Theory: A Model of Global Discourse Cohesion and Coherence Proceedings of the 36th Annual Meeting 4 Conclusions of the ACL and of the 17th COLING (COLING/ACL’98) In discourse processing, the Right Frontier Con-Dan Cristea and Bonnie L Webber 1997 Expecta- straint plays a prominent role at least from two tions in Incremental Discourse Processing Proceedings perspectives: a theoretical one, by offering expla-of the 35th Annual Meeting of the Association for Com- nations for the choices humans make in using lin-putational Linguistics, Madrid guistic pointers, as pronouns or demonstratives, Barbara Grosz and Candace Sidner 1986 Attention, Intention and the Structure of Discourse Computational and a computational one, by offering a strong con-Linguistics, 12 dition which significantly reduces the indetermi-Jeanette Gundel, Nancy Hedberg, and nism in processing Recently, evidence has been Ron Zacharski 1993 Cognitive Status and the Form of found in chat corpora, which militates apparently Referring Expressions Language, 69 against this constraint In these examples, dia-Aravind Joshi 1987 An Introduction to Tree Adjoin- logues display intertwined turn-taking, which seem ing Grammars In A Manaster-Ramer (ed ) Mathemat- to minimize the RFC behind a condition of appli-ics of Language J Benjamins Publishing House cability (Sassen and Kühnlein, 2005) However, Daniel Marcu 2000 The Theory and Practice of Dis- the introduction of this condition leaves “pressure” course Parsing and Summarization The MIT Press dialogues completely unconstrained, which is un-Cambridge, Massachusetts Michael O'Donnell 1997 RST-Tool: An RST realistic The paper rehabilitates the applicability Analysis Tool Proceedings of the 6th European Work- of the Right Frontier Constraint in all cases of dis-shop on Natural Language Generation March 24 - 26, course interpretation, by formally proving that 1997 Gerhard-Mercator University, Duisburg whenever the discourse structure is represented as Livia Polanyi 1988 A formal model of the structure of discourse Journal of Pragmatics, 12 Claudia Sassen and Peter Kühnlein 2005 The Fron- tier Constraints as Conditional In Computational Lin- guistics and Intelligent Text Processing, Proceedings of the 6th International Conference, CICLing 2005, Mexico City, Springer Radu Soricuţ and Daniel Marcu 2003 Sentence Level Discourse Parsing using Syntactic and Lexical Information Proceedings of the Human Language Technology and North American Association for Com- putational Linguistics Conference (HLT/NAACL), May 27-June 1, Edmonton, Canada Bonnie L Webber 1991 Structure and Ostension in the Interpretation of Discourse Deixis Natural Lan- guage and Cognitive Processes, 6(2) 